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It is shown that a N x N real symmetric [complex hermi- 
tian] positive definite matrix V is congruent to a diagonal ma¬ 
trix modulo a pseudo-orthogonal [pseudo-unitary] matrix in 
SO(m, n) [ SU (m, n)], for any choice of partition N = m + n. 
It is further shown that the method of proof in this con¬ 
text can easily be adapted to obtain a rather simple proof 
of Williamson’s theorem which states that if N is even then 
V is congruent also to a diagonal matrix modulo a symplectic 
matrix in Sp(N,1Z) [Sp(N,C)]. Applications of these results 
considered include a generalization of the Schweinler-Wigner 
method of ‘orthogonalization based on an extremum principle’ 
to construct pseudo-orthogonal and symplectic bases from a 
given set of linearly independent vectors. 
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I. INTRODUCTION 

It is well known that a IV-dimensional real symmet¬ 
ric [complex hermitian] matrix V is congruent to a di¬ 
agonal matrix modulo an orthogonal [unitary] matrix 
[fl]. That is, V = S^DS where D is diagonal and 
S £ SO(N) [5 £ SU(N)]. If, in addition, V is also 
positive definite, new possibilities arise for establishing 
its congruence to a diagonal matrix. For N even, it was 
shown by Williamson [g| some sixty years ago, and sub¬ 
sequently by several authors H||, that such a V is also 
congruent to a diagonal matrix modulo a symplectic ma¬ 
trix in Sp(N,lZ) [. Sp(N,C )]. That is, V >0 implies 
V = S^D'S where D' is diagonal and S £ Sp(N,lZ) 
[5 £ Sp(N,C)}. Williamson’s theorem has recently been 
exploited in defining quadrature squeezing and symplec- 
tically covariant formulation of the uncertainty principle 
for multimode states ||. In this work we establish yet 
another kind of congruence of a real symmetric [com¬ 
plex hermitian] positive definite matrix to a diagonal ma¬ 
trix valid, for both odd and even dimensions. We show 
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that an IV-dimensional real symmetric [complex hermi¬ 
tian] positive definite matrix V is congruent to a diago¬ 
nal matrix modulo a pseudo-orthogonal [pseudo-unitary] 
matrix. That is, V >0 implies V = S' D"S where D" 
is diagonal and S £ SO(m,n) [5 £ SU(m,n)\, for any 
choice of partition N = m + n. A simple proof of this re¬ 
sult is given. The strategy adopted in proving this result, 
with appropriate modification, works for the Williamson 
case as well, and affords a particularly simple proof of 
Williamson’s theorem. Needless to add that the diagonal 
entries of neither D' nor D" correspond to the eigenval¬ 
ues of V. 

The theorems established here play a crucial role in 
enabling one to construct pseudo-orthogonal and sym¬ 
plectic bases from a given set of linearly independent 
vectors via an extremum principle in the spirit of the 
work of Schweinler and Wigner J6j . In an important con¬ 
tribution to the age old “orthogonalization problem” - 
the problem of constructing an orthonormal set of vec¬ 
tors from a given set of linearly independent vectors - 
Schweinler and Wigner proposed an orthonormal basis 
which, unlike the familiar Gram-Schmidt basis (which 
depends on the particular initial order in which the given 
linearly independent vectors are listed), treats all the lin¬ 
early independent vectors on an equal footing and has 
since found important application in wavelet analysis [[zj . 
More significantly, they showed that this special basis 
follows from an extremum principle. In this work, we 
exploit our results on congruence to obtain generaliza¬ 
tions of the Schweinler-Wigner exremum principle lead¬ 
ing to pseudo-orthogonal and symplectic bases from a 
given set of linearly independent vectors. Conversely, the 
extremum principle, once formulated, can be interpreted 
as a procedure for finding the appropriate congruence 
transformation to effect the desired diagonalization. 

II. CONGRUENCE OF A POSITIVE MATRIX 
UNDER PSEUDO-ORTHOGONAL 
[PSEUDO-UNITARY] TRANSFORMATIONS 

The fact that a real symmetric [complex hermitian] 
matrix is congruent to a diagonal matrix modulo an or¬ 
thogonal [unitary] matrix is well known. While congru¬ 
ence coincides with conjugation in the real orthogonal 
and complex unitary cases, they become distinct when 
more general sets of transformations are involved. A 
question which naturally arises is whether congruence to 
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a diagonal form can also be achieved through a pseudo 
orthogonal [pseudo-unitary] transformation. The answer 
to this question turns out to be in the affirmative with 
the caveat that the matrix in question be positive defi¬ 
nite, and can be formulated as the following theorem: 
Theorem 1: Let V be a real symmetric positive definite 
matrix of dimension N. Then, for any choice of partition 
N = m + n, there exists an S' £ SO(m , n) such that 

S T VS = D 2 = diagonal (and > 0). (1) 

Proof: We begin by recalling that the group SO(m,n) 
consists of all real matrices which satisfy S T gS = 
g, det S = 1, where g =diag( 1,1, • • •, 1, —1, •••,—! ). 

V v ^ V v ✓ 

m n 

Consider the matrix V~ 1 ^ 2 gV~ 1 ^ 2 constructed from the 

given matrix V. Since V~ 1 ^ 2 gV~ 1 ^ 2 is real symmetric, 
there exists a rotation matrix R £ SO(N) which diago¬ 
nalizes V~ x l 2 gV~ 1 / 2 : 

R T V- ll2 gV~ 1/2 R = diagonal = A . (2) 

This may be viewed also as a congruence of g using 
V~ 1 / 2 R, and signatures are preserved under congruence. 
(Indeed, signatures are the only invariants if we allow 
congruence over the full linear group GL(N,1Z) ). As a 
consequence, the diagonal matrix A can be expressed as 
the product of a positive diagonal matrix and g : 

R T V- 1/2 gV~ 1/2 R = D~ 2 g = D-'gD- 1 . (3) 

Here D is diagonal and positive definite. 

Taking the inverse of the matrices on both sides of (3) 
we find that the diagonal entries of gD 2 = D 2 g are the 
eigenvalues of V 1 ^ 2 gV 1 ^ 2 and that the columns of R are 
the eigenvectors of V 1 ^ 2 gV 1 ^ 2 . Since V 1 ^ 2 gV 1 ^ 2 , gV, and 
Vg are conjugate to one another, we conclude that D 2 is 
determined by the eigenvalues of gV ~ Vg. 

Define S = V~ 1 ^ 2 RD. It may be verified that S satis¬ 
fies the following two equations : 

S T gS = g, 

S T VS = D 2 = diagonal. (4) 

The first equation says that S £ SO(m, n) and the second 
says that V is diagonalized through congruence by S. 
Hence the proof. 

group SO(m,n) by SU(m,n), and R £ SO(N ) by 
U £ SU(N) in the statement and proof of the above 
theorem, we have the following theorem which applies to 
the complex case. 

Theorem 2: Let V be a hermitian positive definite matrix 
of dimension N. Then, for any partition N = m+n, there 
exists an S' £ SU(m,n) such that 

S^VS = D 2 = diagonal (and > 0). (5) 


III. A SIMPLE PROOF OF WILLIAMSON’S 
THEOREM 


It turns out that the above procedure when applied to 
the real symplectic group of linear canonical transforma¬ 
tions leads a particularly simple proof of Williamsons’s 
theorem. 

Theorem 3: Let V be a 2n-dimensional real symmet¬ 
ric positive definite matrix. Then there exists an S £ 
Sp(2n, TV) such that 

S T VS = D 2 >0, 

D 2 = diag(/vi, k 2 , • ■ •, Kn, « l, «2, ■ • •, ««)• (6) 

Proof: Note that the 2n-dimensional diagonal matrix D 
has only n independent entries. The group Sp(2n,TV) 
consits of all real matrices S which obey the condition 

S T ftS = (3 , /3=(_° 10 1 ) i (7) 


with 1 and 0 denoting the n x n unit and zero matrices 
respectively. Even though S T (3S = /3 may appear to 
suggest that detS = ±1, it turns out that detS = 1. 
In other words, Sp(2n,TZ) consists of just one connected 
(though not simply connected) piece. Indeed, for every 
n > 1 the connectivity property of Sp(2n, 7V) is the same 
as that of the circle. 

The most general S £ GL(2n,TZ) which solves 
S T VS = D 2 is S = V~ 1 ^ 2 RD, where R £ 0(2n). Note 
that none of the factors D,R or V -1 / 2 is an element of 
Sp(2n,lZ). However, a H-dependent choice of D,R can 
be so made that the product V~ l ^ 2 RD is an element of 
Sp(2n,lZ) as we shall now show. 

Since /? T = — (3, it follows that M = V~ 1 ^ 2 pV~ 1 ^ 2 is 
antisymmetric. Hence there exists an R £ SO(2n) such 
that H 

r t v- 1 / 2 pv~ 1/2 r = ( _° n o ) ’ n = dia S onal > 0 • 


Define a diagonal positive definite matrix 


D = 


n -1 / 2 o \ 

o q -1 / 2 ) ' 


(9) 


Then we have 

DR T V- 1/2 f3V~ 1/2 RD = p . (10) 

Now define S = V~ 1 / 2 RD. It may be verified that S 
enjoys the following properties: 

S t / 3S = p , 

S T VS = D 2 = diagonal. (11) 
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The first equation says that S £ Sp(2n ,, TV) and the 
second one says that V is diagonalized by congruence 




through S. This completes the proof of the Willianson 
theorem. To appreciate the simplicity of the present the 
reader may like to compare it with two recently published 
proofs of the Williamson theorem (J] . 

We wish to explore the structure underlying the above 
proof a little further so that the relationship between 
D and S in (11) on the one hand and the eigenvalues 
and eigenvectors of (3V~ l (or 1//2 /3W —1 / 2 ) on the other 
becomes transparent. Again consider the matrix Ad = 
V~ 1 ^(3V- 1 / 2 . It is a real, non-singular, anti-symmetric 
matrix and hence its eigenvalues iu) a and eigenvectors rj a 
have the following properties: 

Mr/ a = iu! a r] a , a = l, ■••,2 n\ 

LOk > 0, k = l, ton+k = -Wfc ; 

Vn+k = ril\ k = 1, • • •, n. (12) 

The eigenvectors rj a can be chosen to be orthonormal 
even when the eigenvalues iuj a are degenerate. Arrange 
the eigenvectors r] a as columns of a matrix U. The ma¬ 
trix U thus obtained clearly belongs to the unitary group 
U(2n), and satisfies 

U<MU = A, A=(®_° n ), (13) 


where = diag(uq, ■ • • ,w n ) > 0. Now define the follow¬ 
ing 2 n x 2n unitary matrices 


E = 


0 1 
1 0 




(14) 


These two matrices have the properties E 2 = 1, t/E = 
U*, and EA = A* (* denotes complex cojugate of a ma¬ 
trix). As a useful consequence of these properties we 
have 


U*A* = LTEEA* = U A. (15) 


We find that the unitary matrix U A is real: U A £ 0{2n). 

Now consider S = V~ 1 ^ 2 U AD, where I? is a diagonal 
matrix to be determined. It follows from the definition 
of S and the reality of UA £ 0(2n) that 

S T VS = S' 1 'VS = D 2 . (16) 

Further, recalling that MU = A we obtain 

S t /3S = S'/3S = DAW MU AD 

= MUAD = T^ ° n ^ ^JD. (17) 

It is now evident that the following choice for D ensures 
that S is an element of S £ Sp(2n,1Z): 


D = 


n - 1 / 2 o \ 

o s I - 1 / 2 J • 


(18) 


This completes our analysis of the manner in which S 
and D are related to the eigenvalues and eigenvectors of 
the matrix pV~ l . 


As in the pseudo-orthogonal case, by replacing the su- 
percript T by f in the statement and proof of Theorem 
3, one obtains the following result. 

Theorem 4: Let V be a 2n-dimensional hermitian positive 
definite matrix. Then there exists an S £ Sp(2n,C) such 
that 

&VS = D 2 > 0, 

D 2 = diag(«;i, k 2 , ■■■,««, ki, « 2 , ■ • •, «„). (19) 

An immediate consequence of the theorems stated 
above is that for a real symmetric [complex hermitian] 
positive definite matrix we can not talk about the canon¬ 
ical form under congruence, for there are m + n possible 
choices of SO(m,n) [SU(m,n)\, and in the case of even 
dimension one more choice coming from Williamson’s 
theorem. Needless to add that for the same matrix V, the 
diagonal matrix D will be different for different choices. 


IV. ORTHOGONALZATION PROCEDURES 

Assume that we are given a set of linearly indepen¬ 
dent V-dimensional vectors v\, ■ ■ ■, vn- Let G denote 
the associated Gram matrix of pairwise inner products: 
Gij = ( Vi,Vj ). The Gram matrix is hermitian by con¬ 
struction, and positive definite by virtue of the linear 
independence of the given vectors. The orthogonaliza- 
tion problem, i.e., constructing a set of orthonormal vec¬ 
tors out of the given set of linearly independent vectors, 
amounts to finding a matrix S that solves 

S^GS = 1, i.e., G- 1 = SS^. (20) 

Each such S defines an orthogonalization procedure. 

Let us arrange the set of N vectors as the entries of a 
row v = (rq, r» 2 , • • •, Vn), and let z = (z,\, • • •, Zn) rep¬ 

resent a generic orthonormal basis. The orthonormal set 
of vectors z corresponding to a chosen S are related to the 
given set of linearly independent vectors through z = vS. 
Clearly, there are infinitely many choices for S satisfying 
(20): given an S satisfying (20), any S' = SU where 
U is an arbitrary unitary matrix also satifies (20). Thus 
the freedom available for the solution of the orthonormal¬ 
ization problem is exactly as large as the unitary group 
U(N), and this was to be expected. 

Schweinler and Wigner Q posed and answered the fol¬ 
lowing question: is there a way of descriminating be¬ 
tween various choices of S that solves (20) and hence 
between various orthogonalization procedures? They ar¬ 
gued that a particular choice of orthogonalization pro¬ 
cedure should correspond ultimately to the extremiza- 
tion of a suitable scalar function over the manifold of all 
orthonormal bases, with the given linearly independent 
vectors appearing as parameters in the function. Differ¬ 
ent choices of onthonormal bases will then correspond 
to different functions to be extremized. They preferred 
the function to be symmetric under permutation of the 
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given vectors. As an example they considered the follow¬ 
ing function which is quartic in the given vectors: 

gm(z) = E ^E I (zk,vi) | 2 ^j . (21) 

They showed that the extremum (maximum in this case) 
value of m( z) is given by tr(G 2 ), and this value corre¬ 
sponds to the orthonormal basis z = vUqP~ 1 ^ 2 , where Uq 
is the unitary matrix which diagonalizes G: U^GUq = P. 
We may refer to this as the Schweinler-Wigner basis, and 
the function m{ z) as the Schweinler-Wigner quartic form. 
It is clear that Uq and hence the Schweinler- Wigner ba¬ 
sis is essentially unique if the eigenvalues of the Gram 
matrix G are all distinct. We may note in passing that, 
unlike the Gram-Schmidt orthogonalization procedure, 
the Schweinler-Wigner procedure is democratic in that it 
treats all the linearly independent vectors v on an equal 
footing. 

The content of the work of Schweinler and Wigner has 
recently been reformulated || in a manner that offers 
a clearer and more general picture of the Schweinler- 
Wigner quartic form m(z) and of the orthonormal basis 
which maximizes it. This perspective on the orthogo¬ 
nalization problem plays an important role in our gener¬ 
alizations of the Schweinler-Wigner extremum principle, 
and hence we summarise it briefly. 

Since every orthonormal basis is the eigenbasis of a 
suitable hermitian operator, it is of interest to character¬ 
ize the Schweinler-Wigner basis in terms of such an oper¬ 
ator. Given linearly independent iV-dimensional vectors 
v = (vi,V 2 , ■ • ■, Vn), the operator M = hermi- 

3 

tian positive definite. In a generic orthonormal basis z, 
it is represented by a hermitian positive definite matrix 
M( z) : = (. Zi,Mzj ). Under a change of orthon- 

armal basis z —> z' = z5, M( z) transforms as follows 

M(z) -» M(z') = S^M(z)S , S e U(N) . (22) 

Recall that U(N) acts transitively on the set of all or¬ 
thonormal bases and that tr(M(z ) 2 ) = Y M ( z ^ k \ 2 is 

j,k 

invariant under such a change of basis, and hence is en- 
dependent of z. The Schweinler-Wigner quartic form 
m(z) can easily be identified as ^(M(z)fcfc) 2 . In view 

k 

of the above invariance, maximization of ^~^(Af (z)kk) 2 is 

k 

the same as minimization of Y^\M (z) jk\ 2 ■ The absolute 

jVfc 

minimum of E^l M (z )jk\ 2 equals zero, and obtains when 

M{ z) is diagonal. Thus, the orthonormal basis which 
maximizes E( M ( z ) kk) 2 is the same as the one in which 
k 


M is diagonal, and we arrive at the following important 
conclusion of Ref. [9]: 

Theorem 5: The distinquished orthonormal basis which 
extremizes the Schweinler-Wigner quartic form m( z) over 
the manifold of all orthonormal bases is the same as the 
orthonormal basis in which the positive definite matrix 
M{ z) becomes diagonal. 

Important for the above structure is the fact that the 
invariant tr(M(z) 2 ) is the sum of non-negative quanti¬ 
ties, and therefore a part of it is necessarily bounded. It 
is precisely this property, which can be traced to the un¬ 
derlying unitary symmetry, that is not available when we 
try to generalize the Schweinler-Wigner procedure to con¬ 
struct pseudo-orthonormal and symplectic bases wherein 
the underlying symmetries are the noncompact groups 
SO(m,n) and Sp(2n,1Z) respectively.. 

V. LORENTZ BASIS WITH AN EXTREMUM 
PROPERTY 

In this Section we show how the Schweinler-Wigner 
procedure can be generalized to construct pseudo¬ 
orthonormal basis based on an extremum principle. We 
begin with the case of real vectors. 

We are given a set of linearly independent real N- 
dimensional vectors v = (tq, ■ ■ ■, Vn) and we want to 
construct out of it a pseudo-orthonormal basis [, SO(m , n) 
Lorentz basis with N = m + n], i.e., a set of vectors 
z = (zi, ■ ■ ■, z N ) satisfying 

{z k ,gzi)= g k i, g = diag( 1,1, • • •, 1, —1, • • •, —1). (23) 

N v ✓ S. v ✓ 

m n 

Let M = ’Y^VjvJ as before, and let the symmetric pos- 

3 

itive definite matrix M(z) : M(z)y = ( Zi,Mzj ) repre¬ 
sent M in a generic pseudo-orthonormal basis z. Under 
a pseudo-orthogonal change of basis z —> z' = zS, the 
matrix M(z) transforms as follows: 

M(z) -> M(z') = S t M(z)S , S e SO{m, n ). (24) 

Since S T gS = g (or gS T = S~ x g) by definition, we have 

S : gM( z) gM( z') = S~ 1 gM{z)S. (25) 

That is, as M(z) undergoes congruence, gM{z) under¬ 
goes conjugation. Thus, tr(gM(z)) z , l = 1,2,---, are 
invariant. In what follows we shall often leave implicit 
the dependence of M on the generic pseudo-orthonormal 
basis z. 

Consider the invariant tr(gM(z)gM(z)) corresponding 
to l = 2. Write M = M even + M odd where 

M even = \{M + gMg ), M odd = - gMg). (26) 

In the above decomposition we have exploited the fact 
that g is, like parity, an involution. 
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With M expressed in the (to, n) block form 

M= ( pT ° B ) , ^L T =^L, B t = B, (27) 
we have 

M even = ( g ° ) , M° dd =( C ° T Jj. ( 28 ) 

Symmetry of M implies that M odd and M even are sym¬ 
metric. Further, M odd and M even are trace orthogonal: 
tr(M odd M even ) = 0. Thus, 

tr (gMgM) = tr(M even ) 2 - tr(M odd ) 2 , (29) 

which can also be written as 

tr (MgMg) = tr(M 2 ) - 2tr(M odd ) 2 . (30) 

A few observations are in order: 

• In contradistinction to the original unitary case, 
the invariant in the present case is no more a 
sum of squares. This can be traced to the 
non-compactness of the underlying SO(m,n) sym¬ 
metry. As one consequence, y^(Mfcfc) 2 is not 

k 

bounded. As an example, consider the simplest 
case to = 1, n = 1 and let 

M= ( o , a,6> 0. (31) 


by theorem 1 there exists a Lorentz basis z in which 
M( z) is diagonal and hence M(z) oc ^ = 0. Thus 
the minimum tr(M(z) odd ) 2 = 0, and hence the 
minimum of tr(M(z) 2 ), namely tr(gM(z)gM(z )), 
is attainable. 

The above observations suggest the following two step 
analogue of the Schweinler-Wigner extremum principle 
for Lorentz bases. Choose the submanifold of Lorentz 
bases which minimize the quartic form tr(M(z) odd ) 2 , and 
maximize the Schweinler-Wigner quartic form to(z) = 
^^(M(z)fcfc) 2 within this submanifold. Clearly, the first 
k 

step takes M to a block-diagonal form, and the sec¬ 
ond one diagonalizes it. Thus we have established the 
following generalization of Theorem 5 to the pseudo¬ 
orthonormal case: 

Theorem 6: The distinquished pseudo-orthonormal basis 
which extremizes the “Schweinler-Wigner” quartic form 
inly z) over the submanifold of pseudo-orthonormal bases 
which minimize the quartic form tr(M(z) 2 ) is the same 
as the pseudo-orthonornral basis in which the positive 
definite matrix M( z) becomes diagonal. 

The submanifold under reference consists of Lorentz 
bases which are related to one another through the max¬ 
imal compact (connected) subgroup of SO(m, n), namely 
SO{m) x SO{n). This subgroup consists of matrices of 
the block-diagonal form 

( H ) , Rie 50(to) , Fh G SO(n) , (33) 


Under congruence by the 50(1,1) element 

5 =f cosh/x sinh M \ 

y sinn fi cosh /i J v 7 

the value of ^(M*,*,) 2 changes from a 2 + b 2 to 
k 

a 2 + b 2 + 2a6sinh 2 /rcosh 2 p, which grows with 
p without bounds, showing that ^ ~2(M kk ) 2 and 

k 

lienee tr(M 2 ) is not bounded. Thus, in contrast to 
the unitary case, extremization of the Schweinler- 
Wigner quartic form ^^( Mkk ) 2 will make no sense 
k 

in the absence of further restrictions. 

• The structure of the invariant tr (gMgM) in 
(30) suggests the further restriction needed to 
be imposed: within the submanifold of pseudo- 
orthogonal bases z which keep tr(M(z) odd ) 2 (and 
hence tr(M(z) 2 )) at a fixed value we can maximize 
^^Af(z) 2 fc . In particular we can do this within the 
k 

submanifold which minimizes tr(M(z) odd ) 2 , and 
hence tr(M(z) 2 ). Clearly, zero is the absolute min¬ 
imum of the nonnegative object tr(M(z) odd ) 2 . But 


and this is precisely the subgroup of SO(m,n) transfor¬ 
mations that do not mix the even and odd parts of M (z). 

To conclude this Section we may note that the above 
construction carries over to the complex case, with ob¬ 
vious changes like replacing T by ' and SO(m,n) by 
SU(m , n). 

VI. SYMPLETIC BASIS WITH AN EXTREMUM 
PROPERTY 

Our construction in the pseudo-orthogonal case sug¬ 
gests a scheme by which the Schweinler-Wigner ex¬ 
tremum principle principle can be generalized to con¬ 
struct a symplectic basis. Suppose that we are given 
a set of linearly independent vectors v = (tq, i>2, • • •, tq n ) 
in lZ 2n . The natural symplectic structure in R 2n is spec¬ 
ified by the standard symplectic “metric” /3 defined in 
(7). Let z = (zi, Z 2 , ■ ■ ■, Z 2 n) denote a generic symplectic 
basis. That is, ( Zj,/3zk ) = f3jk , j, k = 1, 2, • • •, 2 n. The 
real symlectic group Sp(2n, R) acts transitively on the 
set of all symplectic bases. 

To generalize the Schweinler-Wigner principle to the 

2n 

symplectic case, we begin be defining M = ^^VjvJ. Let 

j= i 
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M( z) : M(z)ij = ( Zi,Mzj ) be the symmetric positive 
definite matrix representing the operator M in a generic 
symplectic basis z. Under a symplectic change of basis 
z —> zl = zS, S G Sp(2n,TZ), the matrix M(z) undergoes 
the following transformation: 


M{ z) 

-► M(z') = S t M(z)S , S G Sp{2n, R). 

(34) 

Since S T 

/3S = (3 implies (3S T = S~ 1 f3, we have 


S 

: (3M(z) -> /3M(z') = S~ 1 pM(z)S. 

(35) 


That is, under a symplectic change of basis M (z) un¬ 
dergoes congruence, but /3M( z) undergoes conjugation. 
Hence tr(/3M(z)) , l = 1,2, - - •, n are invariant (Note 
that tr(/3M(z)) 2i+1 = 0 in view of /3 T = — /?, M(z) T = 

M( z))- 

Since i/3 is an involution we can use it to separate M (z) 
into even and odd parts : 


The structural similarity of this invariant to that in the 
pseudo-orthogonal case should be appreciated. 

Now, by an argument similar to the pseudo-orthogonal 
case one finds that, owing to the noncompactness 
of Sp(2n,lZ), the function tr(M(z) 2 ) and hence the 

2 n 

Schweinler-Wigner quartic form ]T(M(z) kk) 2 is un- 

k =1 

bounded if z is allowed to run over the entire manifold of 
all symplectic bases. For instance, in the lowest dimen¬ 
sional case n = 1 with M chosen to be 

M = | ^ j , a, b > 0, ab — ud > 0, (42) 

under congruence by the Sp( 2,, 7Z) matrix 

s =(o 1/p)' < 43 > 


M(z) = M (z) even + M(z) odd , 

M(z) even = i(M(z)+/3M(z)/3 T ), 

M(z) odd = l(Af(z) - /3M(z)/ 3 t ) . (36) 

The even and odd parts of M(z) satisfy the symmetry 
properties 

/3M(z) even /3 T = Af(z) even , /3M(z) odd /3 T = -M(z) odd . 

(37) 

Further, M(z) odd and M(z) even are trace orthogonal: 
tr (M(z) odd M(z) even ) = 0. 

The structure of the even and odd parts of M(z) may 
be appreciated by writing M(z) in the block form 


M( z)= ( ), = lU = B . 

We have 


M(z) even = 


M (z) odd = 


i(ri + H) \{C-C T ) 

-\{C-C T ) \{A + B) 
h(A-B) i (C + C T ) 

I (C + C T ) l(B-A) 


(38) 


(39) 


Now consider the invariant — tr(/3M(z)/3M(z)) = 

tr(/3 T M(z)/3M(z)). We have 

tr(/3 T M(z)/3M(z)) = tr(M(z) even ) 2 - tr(M(z) odd ) 2 , 

(40) 


the value of ^(M^) 2 changes from a 2 + b 2 to p 2 a 2 + 
k 

(1 /pi 2 )b 2 which, by an appropriate choice of /i, can be 
made as large as one wishes. 

However, it follows from (41) that over the submanifold 
of symplectic bases which leave tr(M(z) odd ) 2 fixed, the 
function tr(M(z) 2 ) remains invariant and so the quar¬ 
tic form ]T(M(z)/cfc) 2 is bounded within this restricted 
class of symplectic bases and hence can be maximised. 
In particular the nonnegative tr(M(z) odd ) 2 can be chosen 
to take its minimum value. Williamson theorem implies 
that there are symplectic bases which realize the absolute 
mimumum tr(M(z) odd ) 2 = 0. 

We can now formulate the analogue of the Scweinler- 
Wigner extremum principle for symplectic bases in the 
following way: Take the subfamily of symplectic bases 
in which tr(M(z) odd ) 2 and hence tr(M(z) 2 )is mini¬ 
mum. [This minimum of tr(M(z) 2 ) equals the invari¬ 
ant tr(/3 T M(z)/3M(z))]. Then maximise the Schweinler- 
Wigner quartic form m(z) = ^^(M(z)fcfc) 2 within this 

k 

submanifold of symplectic bases. This will lead, not 
just to a basis in which M(z) is diagonal, but to one 
where M (z) has the Williamson canonical form M (z) = 
diag(«i, ■ ■ ■, K n ; «i, ■ • ■, K n ). We have thus established 
the following generalization of the Schweinler-Wigner ex¬ 
tremum principle to the symplectic case. 

Theorem 7: The distinquished symplectic basis which ex- 
tremizes the “Schweinler-Wigner” quartic form m( z) over 
the submanifold of symplectic bases which minimize the 
quartic form tr(M(z) 2 ) is the same as the symplectic ba¬ 
sis in which the positive definite matrix M (z) assumes 
the Williamson canonical diagonal. 

Note that once M(z) odd = 0 is reached, as implied by 
tr(M(z) odd ) 2 = 0, M(z) has the special even form 


which can also be written as 


(_ A C 5)’ AT = A ’ cT = ~ c - ( 44 ) 


tr(/3 T M(z)/3M(z)) = tr(M(z) 2 ) - 2tr(M(z) ocW ) 2 . (41) 
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so that A + iC is hermitian. The subgroup of sym- 
plectic transformations which do not mix M(z) even with 
M(z) odd , and hence maintain the property M(z) odd = 0 
have the special form 

X + iYeU(n). (45) 

This subgroup, isomorphic to the unitary group U(n), is 
the maximal compact subgroup |uj of Sp(2n,lZ). Thus, 
diagonalizing M( z) using symplectic change of basis, af¬ 
ter it has reached the even form, is the same as diagonal¬ 
izing an n-dimensional hermitian matrix using unitary 
transformations. 


VII. CONCLUDING REMARKS 

To conclude, we have shown that an N x N 
real symmetric [complex hermitian] positive definite 
matrix is congruent to a diagonal form modulo a 
pseudo-orthogonal [pseudo-unitary] matrix belonging to 
SO(m,n ) [SU(m,n)], for any choice of partition N = 
m + n. The method of proof of this result is 
adapted to provide a simple proof of Williamson’s the¬ 
orem. An important consequence of these theorems 
is that while a real-symmetric [complex-hermitian] pos¬ 
itive definite matrix has a unique diagnal form un¬ 
der conjugation, it has several different canonical di¬ 
agnal forms under congruence. The theorems devel¬ 
oped here are used to formulate an extremum princi¬ 
ple a la Schweinler and Wigner for constructing pseudo¬ 
orthonormal [pseudo-unitary] and symplectic bases from 
a given set of linearly independent vectors. Conversely, 
the extremum principle thus formulated can be used 
for finding the congruence transformation which brings 
about the desired diagonalization. 

It is interesting that pseudo-orthonormal basis and 
symplectic basis could be constructed by extremizing pre¬ 
cisely the same Schweinler-Wigner quartic form m( z) = 
YXM(z) kk f that was originally used to construct or- 
k 

thonormal basis in the unitary case. However, it must 
be borne in mind that the similarity in the structure of 
the quartic form to be extremized in the three cases con¬ 
sidered is only at a formal level. In reality, the three 
quartic forms are very different objects, for they are 
functions over topologically very different manifolds: z 
runs over the group manifold U(N) of orthogonal frames 
in the original Schweinler-Wigner case, the group mani¬ 
fold SO{m , n) of pseudo-orthogonal frames in the Lorentz 
case, and over the group manifold Sp(2n, 1Z) in the sym¬ 
plectic case. This has the consequence that, unlike the or¬ 
thogonal case, this quartic form is unbounded in the non¬ 
compact SO(m,n)[SU(m,n)] and Sp(2n,lZ)[Sp(2n,C)\ 
cases. Insight into the structure of these groups was 
used to achieve constrained extremization within a nat¬ 
ural maximal compact submanifold. 
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